Application of Information Technology: A Statistical Approach to Scanning the Biomedical Literature for Pharmacogenetics Knowledge

نویسندگان

  • Daniel L. Rubin
  • Caroline F. Thorn
  • Teri E. Klein
  • Russ B. Altman
چکیده

OBJECTIVE Biomedical databases summarize current scientific knowledge, but they generally require years of laborious curation effort to build, focusing on identifying pertinent literature and data in the voluminous biomedical literature. It is difficult to manually extract useful information embedded in the large volumes of literature, and automated intelligent text analysis tools are becoming increasingly essential to assist in these curation activities. The goal of the authors was to develop an automated method to identify articles in Medline citations that contain pharmacogenetics data pertaining to gene-drug relationships. DESIGN The authors built and evaluated several candidate statistical models that characterize pharmacogenetics articles in terms of word usage and the profile of Medical Subject Headings (MeSH) used in those articles. The best-performing model was used to scan the entire Medline article database (11 million articles) to identify candidate pharmacogenetics articles. RESULTS A sampling of the articles identified from scanning Medline was reviewed by a pharmacologist to assess the precision of the method. The authors' approach identified 4,892 pharmacogenetics articles in the literature with 92% precision. Their automated method took a fraction of the time to acquire these articles compared with the time expected to be taken to accumulate them manually. The authors have built a Web resource (http://pharmdemo.stanford.edu/pharmdb/main.spy) to provide access to their results. CONCLUSION A statistical classification approach can screen the primary literature to pharmacogenetics articles with high precision. Such methods may assist curators in acquiring pertinent literature in building biomedical databases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Statistical Approach to Scanning the Biomedical Literature for Pharmacogenetics Knowledge

Design: The authors built and evaluated several candidate statistical models that characterize pharmacogenetics articles in terms of word usage and the profile of Medical Subject Headings (MeSH) used in those articles. The bestperforming model was used to scan the entire Medline article database (11 million articles) to identify candidate pharmacogenetics articles. Results: A sampling of the ar...

متن کامل

طراحی یادگیری مبتنی ‌بر وب با تأکید بر معرفت‌شناسی سازنده‌گرایی

  Current growth of philosophical and educational theories and computer technology has provided new forms of education in the world. Modern world has features such as communication, non-congruence, and flexibility. Therefore, web and other multimedia technologies are just information and application resources unless could provide learning field and content. The purpose of this study is reconstr...

متن کامل

Application of Rough Set Theory in Data Mining for Decision Support Systems (DSSs)

Decision support systems (DSSs) are prevalent information systems for decision making in many competitive business environments. In a DSS, decision making process is intimately related to some factors which determine the quality of information systems and their related products. Traditional approaches to data analysis usually cannot be implemented in sophisticated Companies, where managers ne...

متن کامل

Human Capital Content Analysis: General Pattern and Application for Graduates of Persian Literature

Economists use human capital as a black box in their models, regardless of the content. It does not have the necessary importance and effectiveness for policy development of higher learning and employment of higher education graduates. Therefore, this study aimed to reopen this black box and analyze its content theoretically and experimentally. To achieve this goal, first the concept of human c...

متن کامل

Telework in I.R.I.B and the role of information technology

Information and communication technologies lead to de- velopment of its application in various fields, particularly teleworking. The purpose of current study is to investigate the role of individual and organizational antecedents and attitudes in establishing employees’ tendency to telework- ing in Islamic Republic of Iran Broadcasting. further, the role of information technology in the issue w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of the American Medical Informatics Association : JAMIA

دوره 12 2  شماره 

صفحات  -

تاریخ انتشار 2005